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A HUMAN ENDOTHELIN RECEPTOR 



BACKGROUND OF THE INVENTION 



1. Field of the Invention: 

The present invention relates to a human 
endothelin receptor, DNA sequence encoding the recep- 
tor, an expression vector carrying the DNA sequence, a 
transf ormant comprising the expression vector, and a 
method for producing a human endothelin receptor from 
the transf ormant . 

2. Description of the Prior Art: 

An endothelin receptor ( ET-receptor ) is a 
receptor for an endothelin (ET). ET-receptors derived 
from animals such as bovines and rats have been known. 
An ET is a peptide present in various tissues in ani- 
mals and is known as a strong vasoconstrictor. Cloning 
and sequence analysis of known ET genes have revealed 
that the ETs comprise three kinds of isopeptides: 
Endothelin 1 (ET-i), Endothelin 2 (ET-2), and Endothe- 
lin 3 (ET-3). Thereafter, it has been found that these 
ETs are disti ibuted in a wide variety of vascular and 
non- vascular tissues (Proc. Natl* Acad. Sci. U.S.A. 86 , 
2863-2867 (1989); Trends in Pharmacol. Sci. 10, 374-378 

(1989) ; and Proc. Natl. Acad. Sci. U.S.A. 87, 2359-2363 

(1990) ). ET-1 has initially been identified as c 
strong vasoconstrictive peptide with 21-amino-acid 
residues produced by porcine vascular endothelial cells 
(Nature, 332 , 411-415 (1988)). 



It has previously been shown in vivo that 
ET-1 and ET-2 are much more strong vasoconstrictors 
than ET-3, whereas the three ET isopeptides are roughly 
equipotent in producing the transient vasodilation. 



As described above, the analysis of nucleic 
acid sequences of ETs has revealed that various kinds 
of ET isopeptides exist. These ET isopeptides are also 
different in their properties. Therefore, it appears 
that various subtypes of ET-receptors exist. The 
existence of various subtypes of ET-receptors has been 
proved by the radioactive ligand binding studies of 
Watanabe, H. et al. (Biochem. Biophys. Res. Commun., 
161 , 1252-1259 (1989)), and Martin, E. R. et al. 
(J. Biol. Chem. 265, 14044-14049 (1990)). These stud- 
ies indicate the existence of, at least, two kinds of 
ET-receptors. One of them has a higher affinity for 
ET-1 and ET-2 than for ET-3; and the other has an 
affinity for ET-1, ET-2, and ET-3 with no selectivity. 

The ET-receptor is useful as a reagent for 
measuring the amount of ET or useful in screening for 
an antagonist of the ET-receptor so as to study agents 
for the circulatory system. Therefore, there is a 
demand for a structure analysis of the ET-receptor and 
effective production of the ET-recepto- by means of 
genetic engineering using the information of this 
structural analysis. 

SUMMARY OF THE INVENTION 

The human endothelin receptor of the present 
invention comprises amino acid sequence from Asp at +1 
to Asn at +407 shown in SEQ ID NO: 1. 

The human endothelin receptor of the present 
invention comprises amino acid sequence from Met at 
-20 to Asn at +407 shown in SEQ ID NO: 1. 
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The DNA sequence of the present invention 
encodes the human endothelin receptor comprising amino 
acid sequence from Asp at +1 to Asn at +407 shown in 
SEQ ID NO: 1. 

5 

The human endothelin receptor of the present 
invention comprises amino acid sequence from Glu at 
+27 to Ser at +442 shown in SEQ ID NO: 2, 

10 The human endothelin receptor of the present 

invention comprises amino acid sequence from Met at +1 
to Ser at +442 shown in SEQ ID NO: 2. 

The DNA sequence of the present invention 
15 encodes the human endothelin receptor comprising amino 
acid sequence from Glu at +27 to Ser at +442 shown in 
SEQ ID NO: 2. 

The expression vector of the present inven- 
20 tion comprises the DNA sequence encoding the human 
endothelin receptor having amino acid sequence from 
Asp at +1 to Asn at +407 shown in SEQ ID NO: 1. 

The transformant of the present invention is 
25 obtained by introducing into a host cell the expression 
vector comprising the DNA sequence encoding the human 
endothelin receptor having amino acid sequence from Asp 
at +1 to Asn at +407 shown in SEQ ID NO: 1. 

30 The expression vector of the present inven- 

tion comprises the DNA sequence encoding the human 
endothelin receptor having amino acid sequence from Glu 
at +27 to Ser at +442 shown in SEQ ID NO: 2. 



The transformant of the present invention is 
obtained by introducing into a host cell the expression 
vector comprising the DNA sequence encoding the human 
endothelin receptor having amino acid sequence from 
Glu at +27 to Ser at +442 shown in SEQ ID NO: 2. 

The method for producing a human endothelin 
receptor of the present invention comprises culturing 
either one of the above-mentioned transf ormants and 
recovering a produced endothelin receptor. 

Thus, the invention described herein makes 
possible the advantage of providing a human ET-recep- 
tor, DNA sequence encoding the ET-receptor, an expres- 
sion vector carrying the DNA sequence, a transformant 
comprising the expression vector , and a method for 
producing an ET-receptor from the transformant* 

These and other advantages of the present 
invention will become apparent to those skilled in the 
art upon reading and understanding the following de- 
tailed description with reference to the accompanying 
figures. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows DNA coding sequence and de- 
duced amino acid sequence of an ET A -receptor according 
-to the present: invention, » 

Figure 2 shows DNA coding sequence and 
deduced amino acid sequence of an ET B -receptor accord- 
ing to the present invention. 



Figure 3 is a graph showing the results of a 
binding assay for determining the binding properties of 
the ET A -receptor to ET-1, ET-2, or ET-3. 

Figure 4 is a graph showing the results of a 
binding assay for determining the binding properties of 
the ET B -receptor to ET-1, ET-2, or ET-3. 

Figure 5 is a chart recording currents, which 
are generated at the time that ET-1 or ET-2 is applied 
to an oocyte of an Xenopus laevis injected with mRNA of 
the ET A -receptor according to the present invention. 

Figure 6 is a chart of autoradiography show- 
ing the results of hybridization of mRNAs isolated from 
a human tissue with a cDNA fragment of the ET A -receptor 
according to the present invention. 

Figure 7 is a chart of autoradiography show- 
ing the results of hybridization of mRNA isolated from 
a human tissue with a cDNA fragment of the ET B -receptor 
according to the present invention. 

Figure 8 is a restriction map of DNA sequence 
of the ET B -receptor according to the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The inventors succeeded in isolating a human 
ET-receptor cDNA from a cDNA library constructed from 
poly(A) + RNA derived from a human placenta, thereby 
achieving the present invention. 



The present invention will be described below 
in order of the steps involved. 

(I) Sequencing of DNA encoding a human ET-receptor: 

First, cDNA prepared from poly(A) + RNA derived 
from a human placenta, by using oligo( dT ) -primer , is 
introduced into phageX ZAPII to construct a cDNA 
library (Sambrook et al.. Molecular Cloning: A Labora- 
tory Manual, Cold Spring Harbor Laboratory, New York 
(1989))* Then, the cDNA library is screened with the 
use of DNA fragment of a known ET-receptor as a probe. 
For example, the cDNA library is screened by hybridiz- 
ing a probe, an NcoI-EcoRI fragment (9 60 bp) of DNA 
encoding a bovine ET-1 receptor, with the cDNA library 
to obtain positive plasmid clone phETIR. In addition, 
the cDNA library is hybridized under less stringent 
conditions to obtain pHETBRl, pHETBR20, pHETBRo 1 and 
pHETBR34. cDNA inserts contained in these clones are 
cut with appropriate restriction enzymes and subcloned, 
after which these cDNA inserts are sequenced by the 
dideoxy chain termination method. The nucleic acid 
sequence of the human ET-receptor thus obtained from 
phETIR and amino acid sequence corresponding thereto 
are shown in SEQ ID NO: 1 in a Sequence Listing. The 
nucleic acid sequence of the human ET-receptor thus 
obtained from pHETBR3 1 and pHETBR34 and amino acid 
sequence corresponding thereto are shown in SEQ ID 
NO: 2 in Sequence Listing. A restriction map of the 
nucleic acid sequence in SEQ ID NO: 2 is shown in 
Figure 8. The positions of 3 1 termini of the inserts 
contained in pHETBR3 1 and pHETBRl are respectively 
marked with a double line and a wave line in the se- 
quence of Figure 2. 



The ET-receptor encoded by DNA shown in SEQ 
ID NO: 1 is a receptor having an affinity for ET-1 and 
ET-2 ( ET A -receptor ) • The ET-receptor encoded by DNA 
shown in SEQ ID NO: 2 is a receptor having an affinity 
(with no selectivity) for both ET-1, ET-2, and ET-3 
( ET B -receptor ) . 

(1) DNA sequence of an ET-receptor ( ET A -receptor ) 
from phETIR 

As shown m SEQ ID NO: 1 and Figure 1, cDNA 
contained in the above-mentioned plasmid clone phETIR 
has a sequence comprising 4,105 nucleic acids. In this 
nucleic acid sequence, an open reading frame from A at 
485 to A at 1768 are constituted, which encodes a 
427-amino-acid protein with a molecular weight of 
48,726. A sequence adjacent to the initiation codon of 
the open reading frame is quite consistent with a con- 
sensus sequence. A peptide consisting of amino acids 
from Met corresponding to the initiation codon to the 
20th amino acid from Met may be a signal sequence. A 
3'-noncoding region contains ATTTA sequence (underlined 
in the noncoding region of the sequences in Figure 1), 
which are related with instability of mRNA. There is a 
potential polyadenylation signal 22-nucleotides up- 
stream of the poly(A) + tail (broken underlined in 
Figure 1 ) . Hydropathicity analysis of the amino acids 
constituting the protein encoded by this cDNA indicates 
that there are seven hydrophobic clusters of 22-26 
residues in the protein, each being separated by hydro- 
philic amino acid sequences. As described above, the 
protein has seven transmembrane domains, and these 
domains have an extracellular N tail and a cytoplasmic 
C tail. The characteristics of this protein are con- 



sistent with those of the superfamily of G protein- 
coupled receptors. These seven transmembrane domains 
are shown as I to VII in the sequences of Figure 1. 

In the above-mentioned cDNA, there are sever- 
al potential sites for post-translational modification, 
and these sites are identical to those of the bovine 
ET-1 receptor. They include two consensus sequences 
for N-glycosylation, Asn at 9 and 42 (shown by reverse 
triangles in Figure 1); six cysteine residues present 
on the N terminus side of the cytoplasmic C tail (359, 
363, and 365 to 368), one of which may be palmitoylated 
as in the £ 2 -adrenergic receptor; and serine residues 
that can be phosphorylated with serine/ threonine 
kinases (shown by solid circles in Figure 1). 

The nucleic acid sequence of the open reading 
frame of cDNA obtained from phETIR is 91.2% homologous 
to that of bovine ET-1 receptor cDNA. 

(2) DNA sequence of an ET-receptor ( ET B -receptor ) 
derived from pHETBR31 and pHETBR34 

As shown in SEQ ID NO: 2 and Figure 2, cDNA 
obtained from the above two plasmid clones has a se- 
quence comprising 4,301 nucleic acids. In this nucleic 
acid sequence, an open reading frame from A at 238 to A 
at 1566 exists, which encodes a 442-amino acid protein 
with a molecular weight of 49,629. A sequence adjacent 
to the initiation codon of the open reading frame is 
quite consistent with a consensus sequence. A peptide 
consisting of amino acids from Met corresponding to the 
initiation codon to the 26th amino acid from Met may be 
a signal sequence. In the same way as in the DNA 
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sequence of the ET^- receptor derived from -the above- 
mentioned phETIR, an ATTTA sequence, seven transmem- 
brane domains (I to VII), a polyadenylation signal, 
N-glycosylation sites, and serine residues that can be 
5 phosphorylated with serine/ threonine kinases are shown 
in the sequences of Figure 2. 

Recently, Sakurai et al. cloned cDNA encoding 
the ET-receptor of an ET B type from a rat lung (Nature, 
348 , 732-735 (1990)). The amino acid sequence of 
ETg-receptor from a rat is 88% homologous to that of 
the ET-receptor shown in SEQ ID NO: 2, and is 51.9% 
homologous to that of the ET-receptor shown in SEQ ID 
NO: 1. 

The amino acid sequence of the ET A -receptor 
shown in SEQ ID NO: 1 is 55% homologous to that of the 
ET B -receptor shown in SEQ ID NO: 2. The open reading 
frame of the DNA sequence encoding the ETg-receptor 
shown in SEQ ID NO: 2 is 61.1% homologous to that of 
the bovine ET A - receptor . 

(II) Construction of an expression vector, a prepara- 
tion of a transf ormant , and an expression of an 
25 ET-receptor: 

cDNAs encoding the above-mentioned ET-recep- 
tors are introduced into appropriate vectors to con- 
struct expression vectors . For example, a Not I frag- 
ment of the phETIR can be introduced into CDM8 (Nature, 
30 329 , 840-842 (1987)), to obtain an expression vector 
CDM 8 -phETIR . In the same way, an Xbal fragment of 
pHETBR3 4 can be introduced into CDM8 to obtain an 
expression vector CDM8-pHETBR. These expression vec- 
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tors can be introduced into appropriate host cells to 
obtain transf ormants . For example, a transf ormant 
capable of producing an ET -receptor can be obtained by 
transfecting one of the above-mentioned expression vec- 

5 tors into a COS-7 cell. An ET-receptor is produced by 

culturing the transformed COS-7 cell under normal 
conditions* The ET-receptor is expressed (produced) on 
the cell surface. The produced ET-receptor can be 
purified by, for example, combinations of various kinds 

10 of chromatographies. 

The ET-receptor thus produced from a trans- 
f ormant is subjected to a binding assay by the use of 
known ETs and is confirmed to be an ET-receptor. In 
15 addition, it is confirmed which endothelin: ET-1, ET-2, 
or ET-3 the ET-receptor is specifically bound to. 

For example, first, a predetermined amount of 
ET-receptor produced by the COS cell transformed with 

20 "the CDM8-phETIR is added to a mixture of a predeter- 
mined amount of ET-1 labeled with 12o i ( 125 i-ET~l) and 
unlabeled ET-1 and to allow to react. Then, the amount 
of labeled binding complex thus produced is measured. 
In Figure 3, the amount of unlabeled ET-1 is plotted on 

25 a horizontal axis by changing the concentration thereof 
in the range of 10 -1 ^ to 10"^ M, and the radioactivity 
of an ET-ET-receptor complex (radioactivity of the ET 
bound to the transformed cell) is plotted on a vertical 
axis (represented by the symbol $ ). Results ob- 

30 tained by performing a competitive assay using unla- 
beled ET-2 or ET-3 instead of unlabeled ET-1 in the 
same way as the above are also shown in Figure 3 
(represented by the symbols 61 (ET-2) and Jtk (ET-3)). 
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The COS-7 cell obtained by trans feeding the CDM8, which 
is a control plasmid, is cultured and is tested in the 

IOC 

same way as the above. The binding amount of A ^°I-ET-1 
is the same level as the amount of non-specific 
125j_ E rp_ -|_ measured in the presence of an excessive 
amount of unlabeled ET-1 (the results are not shown). 
These results indicate that the affinity of the 
ET-receptor from phETIR according to the present inven- 
tion for the ET is ET-1 ( 1C 50 3.0 x 1CT 9 M) ^ ET-2 
(IC 50 6.1 x 10" 9 M) >> ET-3 (IC 50 1.0 x 10" 6 M or 
more), suggesting that this ET-receptor is the 
ET A -receptor . 

The same procedure of binding assay as de- 
scribed above is done for the ET-receptor produced from 
the COS cell transformed with the CDM8-pHETBR. The 
results are shown in Figure 4 ( represented by the sym- 
bols $ (ET-1), 0 (ET-2), and (ET-3)). IC 5Q is 
about 1.0 x 10"~ 9 M, suggesting that this ET-receptor is 
the ET B -receptor . 

(Ill) Expression of ET-receptor mRNA in a cell: 

mRNA is synthesized from the cDNA of the 
ET-receptor of the present invention. When the synthe- 
sized mRNA is injected into an appropriate cell, for 
example, an oocyte of an Xenopus laeyis, an ET-receptor 
is expressed in the cell membrane. For example, mRNA 
is synthesized from cDNA shown in SEQ ID NO: 1 obtained 
in item (I) with the use of T7RNA polymerase. The 
synthesized mRNA is injected into an oocyte of an 
Xenopus laevis ; as a result, an ET^-receptor is pro- 
duced in the cell memblene. The production of an 
ET^- receptor is confirmed by the following procedure . 
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First, the membrane potential of the oocyte injected 
with mRNA is held at a predetermined value, and then 
this oocyte is brought into contact with a solution 
containing ET-1. If the ET A -receptor of the present 
invention is produced, it is expressed on the cell 
surface, thus bound to ET-1 present outside the cell . 
When the ET A -receptor is bound to ET-1, a current flows 
toward the inside of the cell. Therefore, the produc- 
tion of the ET-receptor of the present invention is 
confirmed by measuring this current . When the oocyte 
was brought into contact with a solution containing 
10" 7 M ET-1, a current of a large value is confirmed to 
flow toward the inside of the cell. When the oocyte 
was brought into contact with a solution containing 
10~ 7 M ET-2 instead of ET-1, the same value of current 
is confirmed to flow* In contrast, when the oocyte is 
brought into contact with a solution containing ET-3, 
only a small value of current is confirmed to flow. 
The comparison in current values between ET-1 and ET-3 
is shown in Figure 5, From this result, the 
ET A -receptor of the present invention has a higher 
affinity for ET-1 than for ET-3. 

(IV) Presence of ET-receptor mRNA in various human 
tissues: 

( 1 ) Presence of ET A - receptor mRNA 

Northern blot hybridization analysis is 
conducted on mRNA isolated from various human tissues 
by using, as a probe, DNA fragment encoding the 
ET A -receptor of the present invention (EcoRV-EcoRI 
fragment from phETIR; nucleic acids 739-1564, 826 bp) 
which is radio- labeled, resulting in a single positive 
band with a size of 4.3 kb. The results are shown in 
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Figure 6, The ET A -receptor mRNA of the present inven- 
tion is present in the aorta at the highest levels; in 
the lung, atrium, colon and placenta at high levels; 
and in the cerebral cortex, cerebellum., ventricle, 
kidney, adrenal and duodenum at moderate levels. A 
hybridized band is not found in the liver or in the 
cultured human umbilical vein endothelial cell. 

As described above, the ET A -receptor mRNA is 
present in the circulatory system, especially in the 
aorta at the highest levels. Since the ET-receptor 
mRNA is not present in the endthelial cell, the 
ET A -receptor mRNA is possibly expressed in the vascular 
smooth muscle cell. Martin et al . describes in 
J. Biol. Chem. 265 , 14044-14049 (1990) that ET-1 and 
ET-2 inhibit the binding of 125 1-ET-1 to a rat A-10 
vascular smooth muscle cell. This result is consistent 
with the experimental results that the ET A -receptor of 
the present invention is present in the vascular smooth 
muscle cell. The ET A ~receptor of the present invention 
appears to be a main subtype of the ET-receptor which 
is expressed in the vascular smooth muscle cell. 

In general , it is known that the concentra- 
tion of ET-1 in plasma is increased due to various 
diseases such as essential hypertension, vasospastic 
stenocardia, acute myocardial infarction, chronic renal 
insufficiency, subarachnoid hemorrhage, and hypoxia. 
It is conceivable that ET-1 produced in and released 
from the endothelial cells is bound to an ET-receptor 
in the vascular smooth muscle cells and acts as a local 
regulator in maintaining vascular tonus. It is con- 
jectured that the increase in concentration of ET-1 due 
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to the above-mentioned diseases is caused by the col- 
lapse of balance between the amount of ET-1 bound to 
the ET-receptor and the amount of ET-1 released. 

( 2 ) Presence of ET B -receptor mRNA 

Northern blot hybridization is conducted as 
described in item (1), by using a probe, 1.2 kb EcoRl 
fragment, which is derived from pHETBR34 and is radio- 
labeled, resulting in that a band with a size of 
4.3 kb and a band with a size of 1.7 kb are found in 
various tissues as shown in Figure 7. it is considered 
that the plurality of mRNAs is due to the difference in 
polyadenylation . 

It is found that mRNAs with a size of 4.3 kb 
and 1.7 kb are expressed in the human cerebral cortex 
and cerebellum at high levels and in the placenta, 
lung, kidney, adrenal, colon and duodenum at moderate 
levels . 

Example 

Hereinafter, the present invention will be 
described by way of illustrating examples,. 

(I) Sequencing of DNA encoding a human ET-receptor: 

( 1 ) Sequencing of DNA encoding a human ET A -recep- 

tor 

First, cDNA prepared from poly(A)" f RNA derived 
from a human placenta, by using oligo( dT ) -primer , was 
introduced into phage \ ZAPII, to construct a cDNA 
library ( Sambrook et al . , Molecular Cloning: A labora- 
tory Manual, Cold Spring Harbor Laboratory, New York 
(1989)). Approximately 1 x 10 6 plaques were screened 
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by using an NcoI-EcoRI fragment (960 bp) of DNA encod- 
ing a bovine ET-1 receptor as a probe (Nature, 348 , 
730-732 (1990)) in the following manner. Filters 
(Colony/Plaque Screen, du Pont, Wilmington, DE ) to 
5 which plaques were replicated were prehybridized for 

6 hours in a solution containing 1% SDS, 1 M NaCl, 10% 
dextran sulfate, 200 pg/ml of yeast tRNA and 250 pg/ml 
of denatured salmon sperm DNA . Then the filters were 
hybridized at 65°C for 18 hours with the probe 
10 ( Nco I- Eco RI fragment) labeled by random-primed synthe- 
sis to the specific activity of 5 x 10° cpm/1 pg DNA. 
The filters were then washed twice (30 min. per wash) 
in 0.2 x SSC (1 x SSC is 0.15 M NaCl, 15 mM sodium 
citrate (pH 7.0)) containing 0.1% SDS at 60 °C. The 
15 resulting filters were subjected to autoradiography in 
which the filters were overlayered with Konica enhanc- 
ing screens and Konica X-ray films (Konica, Tokyo, 
Japan) and left for 4 hours at -80°C. As a result, a 
plurality of clones which were hybridized with the 
20 probe were found. Fragments of the cDNA insert of 
phETIR were subcloned into the Bluescript plasmid 
vector (Stratagene, La Jolla, CA). Both strands ( h -) 
of the cDNA insert were sequenced by the, dideoxy chain 
termination method using Sequenase (United States 
25 Biochemical Corp., Cleveland, OH). The nucleic acid 
sequence and a deduced amino acid sequence of the human 
ET-receptor obtained from phETIR are shown in SEQ ID 
NO: 1 . 

30 (2) Sequencing of DNA encoding a human ET B -recep- 

tor 

In the same way as in item ( 1 ) , cDNA prepared 
from poly(A) + RNA derived from a human placenta, by 
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using oligo(dT)-primer, was introduced into 
phage 7LZAPII to construct a cDNA library. The ap- 
proximately 1 x 10 6 plaques produced were screened 
using the same probe used in item (1) under conditions 
5 different from those in item (1), Filters to which 

plaques were replicated were immersed in a solution 
containing 1% SDS, 1 M NaCl, 10% dextran sulfate, 
200 \xg/ml of yeast RNA and 250 pg/ml of denatured 
salmon sperm DNA, and the plaques were hybridized with 
10 the probe at 65°C for 18 hours. The filters were then 
washed twice (30 min. per wash) in 0-5 x SSC containing 
0.1% SDS at 50 °C. The resulting filters were subjected 
to autoradiography to detect positive clones. Three 
out of 20 positive clones were clones which became 
15 positive even under the highly stringent conditions of 
hybridization described in item (1) above, and there- 
fore, these three clones are cDNAs of ET A -receptors . 
Plasmids obtained from the remaining 17 clones were cut 
with appropriate restriction enzymes and were sequenced 
20 by the dideoxy chain termination. As a result, a cDNA 
sequence shown in SEQ ID NO : 2 was identified from 
pHETBR31 and pHETBR34. 

(II) Construction of an expression vector, a prepara- 
25 tion of a tr ans f ormant , and an expression of an 
ET-receptor : 

( 1 ) ET A -receptor 

A Not I fragment of the phETIR ' obtained in 
item (I) was introduced into a CDM8 (Naixire, 329 , 
30 840-842 (1987) to obtain an expression vector, 
CDM8 -phETIR . COS-7 cells maintained in Dulbecco's 
modified Eagle's medium supplemented with 100 U/ml of 
penicillin and streptomycin and fetal bovine serum 
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(Hazleton, Lenexa, KS ) were transfected with the 
CDM8-phETIR, by a calcium phosphate method. Separate- 
ly, the COS-7 cells were transfected with the control 
plasmid CDM8. Twenty micrograms of DNA per 100 mm 
plate were used for transf ection . The transfected 
cells were treated with 20% glycerol for 4 hours after 
the transfection. Four hours after the glycerol treat- 
ment, the cells were harvested from 100 mm plates and 
5 x 10 4 cells/well were plated on a 24-well cell cul- 
ture plate (Corning, Glass Co* Corning, NY), 

( 2 ) ET B -receptor 

An Xhal fragment (2.7 kb ) of the pHETBR34 
obtained in item (I) was introduced into the CDM8 to 
obtain an expression vector, CDM8-pHETBR. In the same 
way as described in item (1), this vector was intro- 
duced into a C0S-7 cell and cultured. 

(ill) Binding assay of an ET receptor produced from a 
trans formant to an ET: 

125 I-ET-1 ( 125 l-labeled ET-1) (2000 Ci/mmol) 
was purchased from Amersham (Buckinghamshire, UK). 
Unlabeled ET-1, ET-2 and ET-3 were purchaoed from 
Peptide Institute Inc. (Minoh, Japan). 

( 1 ) ET A -receptor 

Binding assays were performed for a trans- 
formant containing CDM8 - phE T I R obtained in 'item (II) in 
a 24-well cell culture plate as follows: 

Confluent cells in the wells (48 hours after 
the glycerol treatment) were washed three times with 
1 ml of Hank's balanced salt solution containing 0.1% 



bovine serum albumin (BSA) (binding medium). A solu- 
tion containing 50 pM of 125 I-ET-1 and various concen- 
trations (ICT 10 to 10" 6 M) of ET-1 was added to each 
well* Separately, a solution containing . ET-2 or ET-3 
instead of ET-1 and a solution containing 125 I-Ef-l 
alone were prepared, and were respectively added to 
each well* These solutions added to the wells were 
incubated at 37 C C for 60 min. Following three washings 
with 1 ml of ice-cold binding medium, the cells were 
dissolved in 0.5 ml of 1 N NaOH. 

The cell-bound radioactivity was measured by 

an autogamma counter (Aloka, Tokyo, Japan). The total 

binding was calculated as follows: (the radioactivity 

in the absence of unlabeled ET-1, ET-2 or ET-3) - (the 

radioactivity in the presence of 4 x 10 M unlabeled 

ET-1). All measurements were conducted twice. As a 

result, the total binding of 125 1-ET-1 was 6900 cpm 

— 7 

(background binding in the presence of 4 x 10 M ET-1 
was 150 cpm). The radioactivity in the presence of 
ET-1, ET-2, or ET-3 in various concentrations is repre- 
sented in per cent of the total binding (6900 cpm). 
The results are shown in Figure 3. It ,is understood 
from Figure 3 that the affinity of the ET-receptor 
derived from the phETIR of the present invention 
for ETs is ET-1 (IC 50 3.0 x 10" 9 M) ^ ET-2 
(1C 50 6.1 x 10~ ? ) >> ET-3 (1C 50 1 x 10" 6 M or more). 

(2) ET B -receptor * 

Binding assays were performed in the same way 
as described in item (1) using a transformant contain- 
ing the CDM8 -pHETBR instead of a transformant contain- 
ing the CDM8-phETIR. The results are shown in 
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Figure 4. In Figure 4, Q shows the radioactivity in 
the presence of ET-2; Q shows the radioactivity in 
the presence of ET-1; and shows the radioactivity 

in the presence of ET-3. It is understood from 
Figure 4 that this receptor has almost the same affini- 
ty for ET-1, ET-2 and ET-3. 

(IV) Expression of ET-receptor mRNA in a cell: 

Approximately 10 mg of mRNA was synthesized 
in vitro from phETIR by using T7RNA polymerase in the 
presence of capping nucleotides. The mRNA thus ob- 
tained was pressure-injected into oocytes of an Xenopus 
laevis with a pipette. The oocytes were then incubat- 
ed in sterile Barth's medium at 20°C for 3 days. 
Electro-physiological measurements were performed at 
20 °C in an ND9 6 solution (96 mM NaCl, 2 mM KC1, 1 mM 
MgCl 2 , 5 mM Hepes, pH 7.6). Two glass microelectrodes 
filled with 4 M potassium acetate solution were insert- 
ed into an oocyte, and the membrane potential was held 
at -60 mV. To this oocyte, 1 x 10" 7 M ET-1, ET-2, or 
ET-3 desolved in the ND 96 solution containing 0.1% 
Triton X-100 and 0.1% gelatin were applied. 

Twenty seconds after the application of the 
ET-1 solution, a large inward current was recorded from 
the oocytes under a holding potential at -60 mV. The 
chart recorded is shown in Figure 5. A similar inward 
current was recorded when 1 x 10" 7 M ET-2 was applied 
(not shown). In contrast, a much smaller current was 
recorded when 1 x 10" 7 M ET-3 was applied (Figure 5). 
The currents caused by the ETs were fluctuating and 
long-lasting, and were characteristic of Ca 2+ - activated 
chloride currents. No currents were recorded when the 
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medium alone (ND9 solution containing 0.1% Triton X-100 
and 0.1% gelatin) was applied (Figure 7). 

It is understood from the above results that 
5 the ET-receptor derived from the phETIR of the present 

invention has a higher affinity for ET-1 or ET-2 than 
for ET-3. 

(V) Presence of ET-receptor mRNA in various human 

10 tissues: 

(1) ET A -receptor 

Among the human tissues used herein, the 
cerebral cortex, cerebellum, aorta, lung, atrium, 
liver, kidney, adrenal, duodenum, colon and placenta 

15 were obtained from an autopsy or operation. These tis- 
sues were weighed, frozen in liquid nitrogen, and 
stored at -70°C until used. Human umbilical vein 
endothelial cells were purchased from Colonetics Corp 
(San Diego, CA), and cultured as described in Lab. 

20 Invest. 63, 115-122 (1990). 

Total RNA was isolated from each tissue by a 
guanidinium isocyanate/cesium chloride method. Total 
RNA was separated on 0.66 M formaldehyde- 1% agarose 

25 gels (20 pg per lane), and transferred to a nylon mem- 
brane (Pall, Glen, Cove, NY) in 20 x SSC. Blots were 
fixed by UV cross-linking and were prehybridized at 
65 °C for 12 hours in a solution containing 4 x SSC, 
10 x Denhardt's solution (1 x Denhardt f s splution is 

30 0.2% polyvinylpyrrolidone, 0.2% BSA, and 0.2% Ficoll), 
0.5% SDS, and 250 pg/ml of denatured salmon sperm DNA. 
The blots were then hybridized at 42 °C for 4 hours in a 
solution containing 50% formamide, 4 x SSC, 5 x Den- 
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hardt's solution, 0.5% SDS, 10% dextran sulfate, 
250 pg/ml of denatured salmon sperm DNA, and the 
radio-labeled Eco RV- Eco RI fragment of the insert of 
phETiR (826 bp; used as a probe). The probe was 

5 labeled by random-primed synthesis to the specific 

activity of 1 x 10^ cpm/pg DNA. The blots were washed 
twice at room temperature (30 min. per wash): once at 
60°C in a solution containing 2 x SSC and 0.1% SDS 
(30 min. per wash) and twice at 60°C in a solution 

10 containing 0.1 x SSC and 0.1% SDS (15 min. per wash). 

The resulting blots were subjected to autora- 
diography in which filters carrying blots were overlay- 
ered with Konica enhancing screens and Kodak X-Omat AR 

15 film (Kodak, Corp. Rochester, NY) and left for 3 days 
at -70 °C. The results are shown in Figure 6. A single 
band with a size of 4.3 kb is located in various tis- 
sues, suggesting that mRNAs of the ET-receptor of the 
present invention are present in various tissues. in 

20 particular, the mRNAs are present in the aorta at the 
highest levels; in the lung, atrium, colon, and placen- 
ta at high levels; and in the cerebral cortex, cerebel- 
lum, ventricle, kidney, adrenal, and duodenum at moder- 
ate levels. A hybridized band is not found in the 

25 liver and in the cultured human umbilical vein endothe- 

lial cell. 

(2) ET B -receptor 

Autoradiography was performed in ti^e same way 
30 as described in item (1) above, except that the radio- 
labeled Eco Rl fragment (1.2 kb ) of the insert of 
pHETBR34 was used as a probe instead of the radio- 
labeled EcoRV-EcoRI fragment of the insert of pHETlR. 
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The results are shown in Figure 7* As shown in 
Figure 7, bands with a size of about 4,3 kb and 1.7 kb 
are located. It is understood that the ET B -receptor 
mRNA is present in the cerebral cortex and cerebellum 
5 at high levels • In addition, unlike the ET A -receptor , 

the ET B -receptor mRNA is present in the umbilical vein 
endothelial cell. 

As described above, according to the present 

10 invention, a novel human endothelin receptor, DNA 
sequence encoding the receptor, an expression vector 
having the DNA sequence, a trans formant comprising the 
expression vector, and a method for producing a human 
endothelin receptor from the transformant are provided, 

15 The receptor shown in SEQ ID NO: 1 is an ET A -receptor 
which has an affinity for ET-1 and ET-2, especially the 
affinity for ET-1 being stronger. The receptor shown 
in SEQ ID NO: 2 is an ET B -recetor which has an affinity 
for ET-1, ET-2 and ET-3 (with no selectivity). Thus, 

20 it is the first time that both an ET A -receptor and an 
ET B -receptor are found in a specific mammal. The 
ET-receptors obtained are useful as an agent for meas- 
uring the amount of ET or useful in screening for an 
antagonist of the ET-receptors so as to study agents 

25 for the circulatory system. 

Various other modifications will be apparent 
to and can be readily made by those skilled in the art 
without departing from the scope and spirit of this 
30 invention. Accordingly, it is not intended that the 
scope of the claims appended hereto be limited to the 
description as set forth herein, but rather that the 
claims be broadly construed. 
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The following specific sequence information 
and descriptions are provided in order to comply with 
the formal requirements of the submission of sequence 
data to the United States Patent and Trademark Office 

5 and are not intended to limit the scope of what the 

inventors regard as their invention ♦ Variations in 
sequences which become apparent to those skilled in the 
art upon review of this disclosure and which are encom- 
passed by the attached claims are intended to be within 

10 the scope of the present invention . Further, it should 
be noted that efforts have been made to insure accuracy 
with respect to the specific sequences and characteris- 
tic description information describing such sequences, 
but some experimental error and/or deviation should be 

15 accounted for. 
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SEQUENCE LISTING 
GENERAL INFORMATION: 

(i) APPLICANT: SHIONOGI SEIYAKU KABUSHIKI KAISHA 

(ii) TITLE OF INVENTION: A human endothelin recep- 

tor 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SHIONOGI SEIYAKU KABUSHIKI 

KAISHA 

(B) STREET: 3-1-8, Dosho-machi 

(C) CITY: Chuo-ku 

(D) STATE: Osaka 

(E) COUNTRY: Japan 

(F) ZIP: 541 

(vii) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette. 3.50 inch. 

1024 kb storage 

(B) COMPUTER: EPSON 

(C) OPERATING SYSTEM: MS-DOS ver. 2.11 

(D) SOFTWARE: Wordstar ver. 5.0 

(viii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 3-172828* 

(B) FILING DATE: 12-July-1991 
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(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human 



(vii) DIRECT ORIGINAL SOURCE: 
(B) CLONE: phETIR 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 485 1768 

(C) IDENTIFICATION METHOD: by experiment 

(A) NAME/KEY: signal peptide 

(B) LOCATION: 485 544 

(C) IDENTIFICATION METHOD: similarity to 
other signal sequences 

(A) NAME /KEY : mat peptide 

(B) LOCATION: 545 1768 

(C) IDENTIFICATION METHOD: similarity to 
other signal sequences 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GAATTCGCGG CCGCCTCTTG CGGTCCCAGA GTGGAGTGGA AGGTCTGGAG CTTTGGGAGG 60 
AGACGGGGAG GACAGACTGG AoGCGTGTTC CTCCGGAGTT TTCTTTTTCG TGCGAGCCCT 120 
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CGCGCGCGCG TACAGTCATC CCGCTGGTCT GACGATTGTG GAGAGGCGGT GGAGAGGCTT 180 

CATCCATCCC ACCCGGTCGT CGCCGGGGAT TGGGGTCCCA GCGACACCTC CCCGGGAGAA 240 

GCAGTGCCCA GGAAGTTTTC TGAAGCCGGG GAAGCTGTGC AGCCGAAGCC GCCGCCGCGC 300 

CGGAGCCCGG GACACCGGCC ACCCTCCGCG CCACCCACCC TCGCTTTCTC CGGCTTCCTC 360 

TGGCCCAGGC GCCGCGCGGA CCCGGCAGCT GTCTGCGCAC GCCGAGCTCC ACGGT GA A A A 420 

i. ^ 

' Jl AAAAG T G A A GGTGTAAAAG C AGC AC AAG T GCAATAAGAG ATATTTCCTC AAATTTGCCT 480 

ffcAAG ATG GAA ACC CTT TGC CTC AGG GCA TCC TTT TGG CTG GCA CTG GTT 529 

"Ae-fc Glu Thr Leu Cys Leu Arg Ala Ser Phe Trp Leu Ala Leu Val 
O -20 -15 -10 

PpGA TGT GTA ATC AGT GAT AAT CCT GAG AGA TAC AGC AC A AAT CTA AGC 577 

F Gly Cys Val lie Ser Asp Asn Pro Glu Arg Tyr Ser Thr Asn Leu Ser 
-5 -11 5 10 

AAT CAT GTG GAT GAT TTC ACC ACT TTT CGT GGC ACA GAG QTC AGC TTC 625 

Asn His Val Asp Asp Phe Thr Thr Phe Arg Gly Thr Glu Leu Ser Phe 
15 20 25 

CTG GTT ACC ACT CAT CAA CCC ACT AAT TTG GTC CTA CCC AGC AAT GGC 673 

Leu Val Thr Thr His Gin Pro Thr Asn Leu Val Leu Pro Ser Asn Gly 
30 35 40 

TCA ATG CAC AAC TAT TGC CCA CAG CAG ACT AAA ATT ACT TCA GCT TTC 721 
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Ser Met: His Asn 
45 

AAA TAG ATT AAC 

Lys Tyr lie Asn 
60 

QTG GGG AAT GCA 
{yal Gly Asn Ala 

! 4GG AAT GGC CCC 

fjg^rg Asn Gly Pro 
It 95 

HVTC TAT GTG GTC 

He Tyr Val Val 
110 

GGG CGC TGG CCT 

Gly Arg Trp Pro 
125 

CTG TTC CCC TTT 

Leu Phe Pro Phe 
140 

CTC TGC GCT CTT 



Tyr Cys Pro Gin 
50 

ACT GTG ATA TCT 

Thr Val He Ser 
65 

ACT CTG CTC AGG 

Thr Leu Leu Arg 
80 

AAC GCG CTG ATA 
Asn Ala Leu He 

ATT GAT CTC CCT 

lie Asp Leu Pro 
115 

TTT GAT CAC AAT 

Phe Asp His Asn 
130 

TTG CAG AAG TCC 

Leu Gin Lys Ser 
145 

AGT GTT GAC AGG 



Gin Thr Lys lie 
55 

TGT ACT ATT TTC 

Cys Thr lie Phe 
70 

ATC ATT TAC CAG 

He He Tyr Gin 
85 

GCC AGT CTT GCC 

Ala Ser Leu Ala 
100 

ATC AAT GTA TTT 
He Asn Val Phe 

GAC TTT GGC GTA 

Asp Phe Gly Val 
135 

TCG GTG GGG ATC 

Ser Val Gly He 
150 

TAC AGA GCA GTT 



Thr Ser Ala Phe 

ATC GTG GGA ATG 

He Val Gly Met 
75 

AAC AAA TGT ATG 

Asn Lys Cys Met 
90 

CTT GGA GAC CTT 

Leu Gly Asp Leu 
105 

AAG CTG CTG GCT 

Lys Leu Leu Ala 
120 

TTT CTT TGC AAG 
Phe Leu Cys Lys 

ACC GTC GTC AAC 

Thr Val Leu Asn 
155 

GCC TCC TGG AGT 
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Leu Cys Ala Leu Ser Val 

160 

CGT GTT CAG GGA ATT GGG 

Arg Val Gin Gly lie Gly 
175 

TCC ATC TGG ATC CTG TCC 

-lias, 

$&r lie Trp lie Leu Ser 

i4c GTC ATG GTA CCC TTT 
Me Val Met Val Pro Phe 

Mg CTC AAT GCC aca TCA 

Met Leu Asn Ala Thr Ser 
220 225 

GAC TGG TGG CTC TTC GGG 

Asp Trp Trp Leu Phe Gly 

240 

GCG ATC TTC TAG ACC CTC 

Ala lie Phe Tyr Thr Leu 
255 



Asp Arg Tyr Arg Ala Val 
165 

ATT CCT TTG GTA ACT GCC 

lie Pro Leu Val Thr Ala 
180 

TTT ATC CTG GCC ATT CCT 



Phe lie Leu Ala lie Pro 
195 

GAA TAT AGG GGT GAA CAG 

Glu Tyr Arg Gly Glu Gin 
210 215 

AAA TTC ATG GAG TTC TAC 

Lys Phe Met Glu Phe Tyr 

230 

TTC TAT TTC TGT ATG CCC 

Phe Tyr Phe Cys Met Pro 
245 

ATG ACT TGT GAG ATG TTG 

Met Thr Cys Glu Met Leu 
260 



Ala Ser Trp Ser 
170 

ATT GAA ATT GTC 1105 

lie Glu lie Val 
185 

GAA GCG ATT GGC 1153 

Glu Ala lie Gly 
200 

CAT AAA ACC TGT 1201 

His Lys Thr Cys 

CAA GAT GTA AAG 1249 

Gin Asp Val Lys 
235 

TTG GTG TGC ACT 1297 

Leu Val Cys Thr 
250 
* 

AAC AGA AGG AAT 1345 

Asn Arg Arg Asn 
265 
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GGG AGC TTG AGA ATT GCC 

Gly Ser Leu Arg lie Ala 
270 

GTG GCA AAA ACA GTT TTC 

Val Ala Lys Thr Val Phe 
285 

TfC CCT CTT CAC TTA AGC 

ijWe Pro Leu His Leu Ser 
i00 305 

a¥g gac aag aac cga tgt 

Si 

Haw 

IYp?t Asp Lys Asn Arg Cys 

S 320 

rSC ATC GGT ATT AAC TTG 

Tyr lie Gly lie Asn Leu 
335 

GCT CTG TAT TTT GTG AGC 

Ala Leu Tyr Phe Val Ser 
350 

CTC TGC TGC TGC TGT TAC 

Leu Cys Cys Cys Cys Tyr 
365 



CTC AGT GAA CAT CTT AAG 

Leu Ser Glu His Leu Lys 
275 

TGC TTG GTT GTA ATT TTT 

Cys Leu Val Val lie Phe 
290 295 

CGT ATA TTG AAG AAA ACT 

Arg lie Leu Lys Lys Thr 

310 

GAA TTA CTT AGT TTC TTA 

Glu Leu Leu Ser Phe Leu 
325 

GCA ACC ATG AAT TCA TGT 

Ala Thr Met Asn Ser Cys 
340 

AAG AAA TTT AAA AAT TGT 

Lys Lys Phe Lys Asn Cys 
355 

CAG TCC AAA AGT CTG ATG 

Gin Ser Lys Ser Leu Met 
370 375 



CAG CGT CGA GAA 1393 

Gin Arg Arg Glu 
280 

GCT CTT TGC TGG 1441 
Ala Leu Cys Trp 

GTG TAT AAC GAA 1489 

Val Tyr Asn Glu 

315 

CTG CTC ATG GAT 1537 

Leu Leu Met Asp 
330 

ATA AAC CCC ATA 1585 

lie Asn Pro lie 
345 

TTC CAG TCA TGC 1633 

Phe Gin Ser Cys 
360 

ACC TCG GTC CCC 1681 

Thr Ser Val Pro 
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ATG AAC GGA ACA AGC ATC CAG TGG A AG AAC CAC GAT CAA AAC AAC CAC 1729 

Met Asn Gly Thr Ser lie Gin Trp Lys Asn His Asp Gin Asn Asn His 
380 385 390 395 

AAC ACA GAC CGG AGC AGC CAT AAG GAC AGC ATG AAC TGACCACCCT 1775 

Asn Thr Asp Arg Ser Ser His Lys Asp Ser Met Asn 

400 405 

f^GAAGCACT CCTCGGTACT CCCATAATCC TCTCGGAGAA AAAAATGACA AGGCAACTGT 1835 

(IllCTCCGGGA ATCTCTTCTC TGATCCTTCT TCCTTAATTC ACTCCCACAC CCAAGAAGAA 1895 

AiGCTTTCCA AAACCGCAAG GTAGACTGGT TTATCCACCC ACAACATCTA CGAATCGTAC 1955 

CTCTTTAATT GATCTAATTT ACATATTCTG CGTGTTGTAT TCAGCACTAA AAAATGGTGG 2015 

(|&GCTGGGGG AGAATGAAGA CTGTTAAATG AAACCAGAAG GATATTTACT ACTTTTGCAT 2075 

GAAAATAGAG CTTTCAAGTA CATGGCTAGC TTTTATGGCA GTTCTGGTGA ATGTTCAATG 2135 

GGAACTGGTC ACCATGAAAC T T TAG AG AT T AACGACAAGA TTTTCTACTT TTTTTAAGTG 2195 

ATTTTTTGTC CTTCAGCCAA AC AC A AT ATG GGCTCAGGTC ACTTTTATTT GAAATGTCAT 2255 

TTGGTGCCAG TATTTTTTAA CTGCATAATA GCCTAACATG ATTATTTGAA CTTATTTACA 2315 

CATAGTTTGA AAAAAAAAAG ACAAAAATAG TATTCAGGTG AGCAATTAGA T^AGTATTTT 2375 

« 

CCACGTCACT ATTTATTTTT T T AAA AC AC A AATTCTAAAG CTACAACAAA TACTACAGGC 2435 

CCTTAAAGCA CAGTCTGATG ACACATTTGG CAGTTTAATA GATGTTACTC A A AG A A T T T T 2495 

TTAAGAACTG TATTTTATTT TTTAAATGGT GTTTTATTAC AAGGGACCTT GAACATGTTT 2555 
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TGTATGTTAA ATTCAAAAGT AATGCTTCAA TCAGATAGTT CTTTTTCACA AGTTCAATAC 2615 

TGTTTTTCAT GTAAATTTTG TATGAAAAAT CAATGTCAAG TACCAAAATG TTAATGTATG 2675 

TGTCATTTAA CTCTGCCTGA GACTTTCAGT GCACTGTATA TAGAAGTCTA AAACACACCT 2735 

AAGAGAAAAA GATCGAATTT TTCAGATGAT TCGGAAATTT TCATTCAGGT ATTTGTAATA 2795 

GTGACATATA TATGTATATA CATATCACCT CCTATTCTCT TAATTTTTGT TAAAATGTTA 2855 

si** 3 !: 

J^TGGCAGTA AGTCTTTTTT GATCATTCCC TTTTCCATAT AGGAAAGATA ATTTTGAAGT 2915 

feeCCAGATGA GTTTATCATG TCAGTGAAAA ATAATTACCC ACAAATGCCA CCAGTAACTT 2975 

Macgattctt cacttcttgg GGTTTTCAGT ATGAACCTAA CTCCCCACCC CAACATCTCC 3035 

CTCCCACATT GTCACCATTT CAAAGGGCCC ACAGTGACTT TTGCTGGGCA TTTTCCCAGA 3095 

fPTTTACAGA CTGTGAGTAC AGCAGAAAAT CTTTTACTAG TGTGTGTGTG TATATATATA 3155 

AACAATTGTA AATTTCTTTT AGCCCATTTT TCTAGACTGT CTCTGTGGAA TATATTTGTG 3215 

TGTGTGATAT ATGCATGTGT GTGATGGTAT GTATGGATTT AATCTAATQT AATAATTGTG 3275 

CCCCGCAGTT GTGCCAAAGT GCATAGTCTG AGCTAAAATC TAGGTGATTG TTCATCATGA 3335 

CAACCTGCCT CAGTCCATTT TAACCTGTAG CAACCTTCTG CATTCATAAA TCTTGTAATC 3395 

ATGTTACCAT TACAAATGGG ATATAAGAGG CAGCGTGAAA GCAGATGAGC TQTGGACTAG 3455 

CAATATAGGG TTTTGTTTGG TTGGTTGGTT TGATAAAGCA GTATTTGGGG TCATATTGTT 3515 

TCCTGTGCTG GAGCAAAAGT CATTACACTT TGAAGTATTA TATTGTTCTT ATCCTCAATT 3575 

CAATGTGGTG AT G A A AT T GC CA3GTTGTCT GATATTTCTT TCAGACTTCG CCAGACAGAT 3635 
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TGGTGATAAT AAATTAGGTA AGATAATTTG TTGGGCCATA TTTTAGGACA GGTAAAATAA 3695 

CATCAGGTTC CAGTTGCTTG AATTGCAAGG CTAAGAAGTA CTGCCCTTTT GTGTGTTAGC 3755 

AGTCAAATCT ATTATTCCAC TGGCGCATCA TATGCAGTGA TATATGCCTA TAATATAAGC 3815 

CATAGGTTCA CACCATTTTG TTTAGACAAT TGTCTTTTTT TCAAGATGCT TTGTTTCTTT 3875 

CATATGAAAA AAATGCATTT TATAAATTCA GAAAGTCATA GATTTCTGAA GGCGTCAACG 3935 

J&3CATTTTAT TTATGGACTG GTAAGTAACT GTGGTTTACT AGCAGGAATA TTTCCAATTT 3995 

STACCTTTAC TACATCTTTT CAACAAGTAA CTTTGTAGAA ATGAGCCAGA AGCCAAGGCC 4055 

STGAGTTGGC AGTGGCCCAT AAGTGTAAAA TAAAAGTTTA CAGAAACCTT 4105 

2 (2) INFORMATION FOR SEQ ID NO: 2: 

P (i) SEQUENCE CHARACTERISTICS: 

hi (A) LENGTH: 4301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 238 1566 

(C) IDENTIFICATION METHOD: by experiment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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GAGACATTCC GGTGGGGGAC TCTGGCCAGC CCGAGCAACG TGGATCCTGA GAGCACTCCC 60 

AGGTAGGCAT TTGCCCCGGT GGGACGCCTT GCCAGAGCAG TGTGTGGCAG GCCCCCGTGG 120 

AGGATCAACA CAGTGGCTGA ACACTGGGAA GGAACTGGTA CTTGGAGTCT GGACATCTGA 180 

AACTTGGCTC TGAAACTGCG GAGCGGCCAC CGGACGCCTT CTGGAGCAGG TAGCAGC 237 

ATG CAG CCG CCT CCA AGT CTG TGC GGA CGC GCC CTG GTT GCG CTG GTT 285 

Set Gin Pro Pro Pro Ser Leu Cys Gly Arg Ala Leu Val Ala Leu Val 

j.0 i 5 10 15 

M2TT GCC TGC GGC CTG TCG CGG ATC TGG GGA GAG GAG AGA G°C TTC CCG 333 

%;? it 

s Leu Ala Cys Gly Leu Ser Arg lie Trp Gly Glu Glu Arg Gly Phe Pro 
O 20 25 30 

mCT GAC AGG GCC ACT CCG CTT TTG CAA ACC GCA GAG ATA ATG ACG CCA 381 

Pro Asp Arg Ala Thr Pro Leu Leu Gin Thr Ala Glu lie Met; Thr Pro 
35 40 45 

CCC ACT AAG ACC TTA TGG CCC AAG GGT TCC AAC GCC AGT CTG GCG CGG 429 

Pro Thr Lys Thr Leu Trp Pro Lys Gly Ser Asn Ala Ser Leu Ala Arg 
50 55 60 

TCG TTG GCA CCT GCG GAG GTG CCT AAA GGA GAC AGG ACG GCA GGA TCT 477 

Ser Leu Ala Pro Ala Glu Val Pro Lys Gly Asp Arg Thr Ala Gly Ser 
65 70 75 80 

CCG CCA CGC ACC ATC TCC CCT CCC CCG TGC CAA GGA CCC ATC GAG ATC 525 
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Pro Pro Arg Thr lie Ser Pro Pro Pro Cys Gin Gly Pro lie Glu lie 

85 90 95 

AAG GAG ACT TTC AAA TAC ATC AAC ACG GTT GTG TCC TGC CTT GTG TTC 573 

Lys Glu Thr Phe Lys Tyr lie Asn Thr Val Val Ser Cys Leu Val Phe 
100 105 110 

GTG CTG GGG ATC ATC GGG AAC TCC ACA CTT CTG AGA ATT ATC TAC AAG 621 

ifal Leu Gly lie lie Gly Asn Ser Thr Leu Leu Arg lie lie Tyr Lys 
5 115 120 125 

|AAC AAG TGC ATG CGA AAC GGT CCC AAT ATC TTG ATC GCC AGC TTG GCT 669 

JLsn Lys Cys Met Arg Asn Gly Pro Asn lie Leu lie Ala Ser Leu Ala 
O 130 135 140 

JjjETG GGA GAC CTG CTG CAC ATC GTC ATT GAC ATC CCT ATC AAT GTC TAC 717 

r Leu Gly Asp Leu Leu His lie Val lie Asp lie Pro lie Asn Val Tyr 
145 150 155 160 

AAG CTG CTG GCA GAG GAC TGG CCA TTT GGA GCT GAG ATG TGT AAG CTG 765 

Lys Leu Leu Ala Glu Asp Trp Pro Phe Gly Ala Glu Met: Cys Lys Leu 

165 170 175 

GTG CCT TTC ATA CAG AAA GCC TCC GTG GGA ATC ACT GTG CTG £GT CTA 813 

Val Pro Phe lie Gin Lys Ala Ser Val Gly lie Thr Val Leu Ser Leu 
180 185 190 

TGT GCT CTG AGT ATT GAC AGA TAT CGA GCT GTT GCT TCT TGG AGT AGA 861 
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Cys Ala Leu Ser lie Asp Arg Tyr Arg Ala Val Ala Ser Trp Ser Arg 
195 200 205 

ATT AAA GGA ATT GGG GTT CCA AAA TGG ACA GCA GTA GAA ATT GTT TTG 909 

lie Lys Gly lie Gly Val Pro Lys Trp Thr Ala Val Glu lie Val Leu 
210 215 220 

ATT TGG GTG GTC TCT GTG GTT CTG GCT GTC CCT GAA GCC ATA GGT TTT 957 

'lie Trp Val Val Ser Val Val Leu Ala Val Pro Glu Ala lie Gly Phe 
<&25 230 235 240 

h Siisr 

j GAT ATA ATT ACG ATG GAC TAC AAA GGA AGT TAT CTG CGA ATC TGC TTG 1005 

,Asp lie lie Thr Met Asp Tyr Lys Gly Ser Tyr Leu Arg lie Cys Leu 
O 245 250 255 

pDTT CAT CCC GTT CAG AAG ACA GCT TTC ATG CAG TTT TAC AAG ACA GCA 1053 

9 Leu His Pro Val Gin Lys Thr Ala Phe Met Gin Phe Tyr Lys Thr Ala 
260 265 270 

AAA GAT TGG TGG CTG TTC AGT TTC TAT TTC TGC TTG CCA TTG GCC ATC 1101 

Lys Asp Trp Trp Leu Phe Ser Phe Tyr Phe Cys Leu Pro Leu Ala lie 
275 280 285 

ACT GCA TTT TTT TAT ACA CTA ATG ACC TGT GAA ATG TTG AGA 4AG AAA 1149 



Thr Ala Phe Phe Tyr Thr Leu Met Thr Cys Glu Met Leu Arg Lys Lys 
290 295 300 

AGT GGC ATG CAG ATT GCT TTA AAT GAT CAC CTA AAG CAG AGA CGG GAA 1197 



Ser Gly Met Gin lie Ala 
305 310 

GTG GCC AAA ACC GTC TTT 

Val Ala Lys Thr Val Phe 

325 

CTT CCC CTT CAC CTC AGC 

Aeu Pro Leu His Leu Ser 

y3 340 

l~AAT GAT CCC AAT AGA TGT 
$ Asn Asp Pro Asn Arg Cys 

S 355 

WTAT ATT GGT ATC AAC ATG 

Tyr lie Gly lie Asn Met 
370 

GCT CTG TAT TTG GTG AGC 

Ala Leu Tyr Leu Val Ser 
385 390 

TTA TGC TGC TGG TGC CAG 

Leu Cys Cys Trp Cys Gin 

405 

AAG CAG TCG TGC TTA AAG 
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Leu Asn Asp His Leu Lys 

315 

TGC CTG GTC CTT GTC TTT 

Cys Leu Val Leu Val Phe 
330 

AGG ATT CTG AAG CTC ACT 

Arg lie Leu Lys Leu Thr 
345 

GAA CTT TTG AGC TTT CTG 

Glu Leu Leu Ser Phe Leu 
360 

GCT TCA CTG AAT TCC TGC 

Ala Ser Leu Asn Ser Cys 
375 380 

AAA AGA TTC AAA AAC TGC 

Lys Arg Phe Lys Asn Cys 

395 

TCA TTT GAA GAA AAA CAG 

Ser Phe Glu Glu Lys Gin 
410 

TTC AAA GCT AAT GAT CAC 



Gin Arg Arg Glu 
320 

GCC CTC TGC TGG 1245 

Ala Leu Cys Trp 
335 

CTT TAT AAT CAG 1293 

Leu Tyr Asn Gin 
350 

TTG GTA TTG GAC 1341 

Leu Val Leu Asp 
365 

ATT AAC CCA ATT 1389 
lie Asn Pro lie 

TTT AAG TCA TGC 1437 

Phe Lys Ser Cys 
400 

TCC TTG GAG GAA 1485 

Ser Leu Glu Glu 
415 

GGA TAT GAC AAC 1533 
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Lys Gin Ser Cys Leu Lys Phe Lys Ala Asn Asp His Gly Tyr Asp Asn 
420 425 430 

TTC CGT TCC AGT AAT AAA TAC AGC TCA TCT TGAAAGAAGA ACTATTCACT 1583 

Phe Arg Ser Ser Asn Lys Tyr Ser Ser Ser 
435 440 

GTATTTCATT TTCTTTATAT TGGACCGAAG TCATTAAAAC AAAATGAAAC ATTTGCCAAA 1643 

J&CAAAACAAA AAACTATGTA TTTGCACAGC ACACTATTAA AATATTAAGT GTAATTATTT 1703 

iJaACACTCAC AGCTACATAT GACATTTTAT GAGCTGTTTA CGGCATGGAA AGAAAATCAG 1763 

CTGGGAATTAA GAAAGCCTCG TCGTGAAAGC ACTTAATTTT TTACAGTTAG CACTTCAACA 1823 

JJagctcttaa CAACTTCCAG GATATTCACA CAACACTTAG GCTTAAAAAT GAGCTCACTC 1883 

ESAGAATTTCTA TTCTTTCTAA AAAGAGATTT ATTTTTAAAT CAATGGGACT CTGATATAAA 1943 

GGAAGAATAA GTCACTGTAA AACAGAACTT TTAAATGAAG CTTAAATTAC TCAATTTAAA 2003 

ATTTTAAAAT CCTTTAAAAC AACTTTTCAA TTAATATTAT CACACTATTA TCAGATTGTA 2063 

ATTAGATGCA AATGAGAGAG CAGTTTAGTT GTTGCATTTT TCG3ACACTG GAAACATTTA 2123 

AATGATCAGG AGGGAGTAAC AGAAAGAGCA AGGCTGTTTT TGAAAATCAT TACACTTTCA 2183 

CTAGAAGCCC AAACCTCAGC ATTCTGCAAT ATGTAACCAA CATGTCACAA AGAAGCAGCA 2243 

TGTAACAGAC TGGCACATGT GCCAGCTGAA TTTAAAATAT AATACTTTTA AAAAGAAAAT 2303 

TAT T ACATCC TTTACATTCA GTTAAGATCA AACCTCACAA AGAGAAATAG AATGTTTGAA 2363 

AGGCTATCCC AAAAG ACT T T "i TTGAATCTG TCATTCACAT ACCCTGT G A A GACAATACTA 2423 
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TCTACAATTT TTTCAGGATT ATTAAAATCT TCTTTTTTCA CTATCGTAGC TTAAACTCTG 2483 

TTTGGTTTTG TCATCTGTAA ATACTTACCT ACATACACTG CATGTAGATG ATTAAATGAG 2543 

GGCAGGCCCT GTGCTCATAG CTTTACGATG GAGAGATGCC AGTGACCTCA TAATAAAGAC 2603 

TGTGAACTGC CTGGTGCAGT GTCCACATGA CAAAGGGGCA GGTAGCACCC TCTCTCACCC 2663 

ATGCTGTGGT TAAAATGGTT TCTAGCATAT GTATAATGCT ATAGTTAAAA TACTATTTTT 2723 

JpAAAATCATA CAGATTAGTA CATTTAACAG CTACCTGTAA AGCTTATTAC TAATTTTTGT 2783 

flATTATTTTTG TAAATAGCCA AT AG AAA AG T TTGCTTGACA TGGTGCTTTT CTTTCATCTA 2843 

y bAGGCAAAAC TGCTTTTTGA GACCGTAAGA ACCTCTTAGC TTTGTGCGTT CCTGCCTAAT 2903 

OrTTTATATCT TCTAAGCAAA GTGCCTTAGG ATAGCTTGGG ATGAGATGTG TGTGAAAGTA 2963 

CPtGTACAAGAG AAAACGGAAG AGAGAGGAAA TGAGGTGGGG TTGGAGGAAA CCCATGGGGA 3023 

CAGATTCCCA TTCTTAGCCT AACGTTCGTC ATTGCCTCGT CACATCAATG CAAAAGGTCC 3083 

TGATTTTGTT CCAGCAAAAC ACAGTGCAAT GTTCTCAGAG TGACTTTCGA AATAAATTGG 3143 

GCCCAAGAGC TTTAACTCGG TCTTAAAATA TGCCCAAATT TTTACTTTGT TTTTCTTTTA 3203 

AT AGGCTGGG CCACATGTTG G A A AT A AGCT AGTAATGTTG TTTTCTGTCA ATATTGAATG 3263 

TGATGGTACA GTAAACCAAA ACCCAACAAT GTGGCCAGAA AGAAAGAGCA A'BAATAATTA 3323 

ATTCACACAC CAT ATGGATT CTATTTATAA ATCACCCACA AACTTGTTCT TTAATTTCAT 3383 

CCCAATCACT TTTTCAGAGG CCTGTTATCA TAGAAGTCAT TTTAGACTCT CAATTTTAAA 3443 

TTAATTTTGA ATCACTAATA TTTTCACAGT TTATTAATAT ATTTAATTTC TATTTAAATT 3503 
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TTAGATTATT TTTATTACCA TGTACTGAAT TTTTACATCC TGATACCCTT TCCTTCTCCA 3563 

TGTCAGTATC ATGTTCTCTA ATTATCTTGC CAAATTTTGA AACTACACAC AAAAAGCATA 3623 

CTTGCATTAT TTATAATAAA ATTGCATTCA GTGGCTTTTT AAAAAAAATG TTTGATTCAA 3683 

AACTTTAACA TACTGATAAG TAAGAAACAA TTATAATTTC T T T AC AT ACT CAAAACCAAG 3743 

ATAGAAAAAG GTGCTATCGT TCAACTTCAA AACATGTTTC CTAGTATTAA GGACTTTAAT 3803 

J^TAGCAACAG ACAAAATTAT TGTTAACATG GATGTTACAG CTCAAAAGAT TTATAAAAGA 3863 

J^TTTAACCTA TTTTCTCCCT TATTATCCAC TGCTAATGTG GATGTATGTT C A AAC ACCT T 3923 

UTTAGTATTGA TAGCTTACAT ATGGCCAAAG GAATACA3TT TATAGCAAAA CATGGGTATG 3983 

ODTGTAGCTAA CTTTATAAAA GTGTAATATA ACAATGTAAA AAATTATATA TCTGGGAGGA 4043 

ffTTTTTTGGTT GCCTAAAGTG GCTATAGTTA CTGATTTTTT ATTATGTAAG CAAAACCAAT 4103 

" A A A A A T T T A A GTTTTTTTAA CAACTACCTT ATTTTTCACT GTACAGACAC TAATTCATTA 4163 

AATACTAATT GATTGTTTAA AAGAAATATA AATGTGACAA GTGGACATTA TTTATGTTAA 4223 

ATATACAATT ATCAAGCAAG TATGAAGTTA TTCAATTAAA ATGCCACATT TCTGGTCTCT 4283 

GGG AAA AAA A AAAAAAAA 4301 



